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Abstract 

We use the Stein-Chen method to study the extremal behaviour of univariate and bivariate 
geometric laws. We obtain a rate for the convergence, to the Gumbel distribution, of the law of 
the maximum of i.i.d. geometric random variables, and show that convergence is faster when 
approximating by a discretised Gumbel. We similarly find a rate of convergence for the law of 
maxima of bivariate Marshall-Olkin geometric random pairs when approximating by a discrete 
limit law. We introduce marked point processes of exceedances (MPPEs), both with univariate 
and bivariate Marshall-Olkin geometric variables as marks and determine bounds on the error 
of the approximation, in an appropriate probability metric, of the law of the MPPE by that of 
a Poisson process with same mean measure. We then approximate by another Poisson process 
with an easier-to-use mean measure and estimate the error of this additional approximation. 
This work contains and extends results contained in the second author's PhD thesis (Feidt 
(2013)) imder the supervision of Andrew D. Barbour. 


1 Introduction 

The problem of determining the behavior of extremes of random variables is a mathematically 
intriguing problem. Already the case of the maximum or minimum of a sample of n indepen¬ 
dent and identically distributed (i.i.d.) random variables Xi,..., presents interesting aspects, 
starting from the existence of a domain of attraction under suitable conditions (Fisher and Tippett 
(1928)). In this work it is also highlighted that for discrete distributions for which the jump heights 
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continue to be too large, no non-degenerate limit distribution may be found for the maxi¬ 
mum of the sample (Leadbetter et al. (1983), Theorem 1.7.13). Well-known examples are the geo¬ 
metric and Poisson distributions (Leadbetter et al., 1983, Examples 1.7.14, 1.7.15). There is, how¬ 
ever, a way to partially remedy this. By allowing the distributional parameter (or one of them) 
to vary with the sample size n at a suitable rate, Anderson et al. (1997) and Mitov and Nadarajah 
(2002) determined extremal limit laws for the Poisson and geometric distributions, respectively. 
In this paper, we will concentrate on the geometric distribution. We will consider both univariate 
and bivariate geometric random variables, as well as point processes with geometric marks, again 
both univariate and bivariate. 

Using a result from the Stein-Chen method for Poisson approximation by Barbour and Hall 
(1984), contained in our work in Theorem 2.5, we determine bounds on the error, in the Kol¬ 
mogorov distance, of the approximation of the maximum, under different normalisations, of i.i.d. 
geometric random variables by the Gumbel distribution. Our results show that convergence is 
faster and requires no conditions on the distributional parameter when approximating by a dis- 
cretised Gumbel distribution. We similarly determine an error bound for the approximation of the 
joint law of maxima of random pairs, following the bivariate Marshall-Olkin geometric distribu¬ 
tion, by an appropriate discrete limit law. 

We further use the Stein-Ghen method for Poisson process approximation to determine bounds 
on the error, in a suitable probability metric, of the law of a marked point process of exceedances 
(MPPE), defined by 

n 

'^u,n'-='^I{Xi>u}SXi, ( 1 . 0 . 1 ) 

i=l 

by that of a Poisson process whose mean measure equals that of the MPPE, where the A, are i.i.d. 
geometric random variables and u denotes a threshold, and 5z denotes the Dirac measure for a 
point z ^ E, the state space of the X/s. Though the MPPE does not mark the points that exceed u 
in the way that a point process of exceedances (PPE) of the form does, it contains 

more information relevant to the study of extreme values than a marked point process (MPP) of 
the form as it is not only a random configuration of points in space, but specifically a 

random configuration of points exceeding a threshold. A related approach is that of empirical point 
processes of exceedances (EPPEs), which have the form d{(j/n,/-i(Xi))eA} for A a Borel set of 
[0, 1] X [0, -l-oo) and / : [0, -|-oo) —^ M a strictly decreasing function. Eor more details on the study 
of these processes, we refer to Leadbetter et al. (1983), Resnick (1987), Barbour et al. (2002). In 
addition to an MPPE with univariate geometric marks, we consider an MPPE with bivariate marks 
that follow the Marshall-Olkin geometric distribution, for which the MPPE indicates whether they 
lie in a subset A of extreme values of the marks' state space. Eor both cases, the estimate for the 
actual "Poisson approximation" comes easily. The reason for this is that we use i.i.d. samples 
Al,..., Xn and i.i.d. indicators If^XieAjj • • • > ^{x„gA}- This allows us to apply Theorem 2.5, which 
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reduces the problem to the approximation of a binomial by a Poisson distribution. However, as the 
marks have geometric, and thereby discrete margins, the mean measure will live on a lattice and 
be rather tedious to work with in practial applications. We would therefore prefer to approximate 
by a further Poisson process with a continuous mean measure. Since the total variation distance 
is too strong for this kind of approximation, we use the weaker (i 2 -distance instead, which is not 
as sensitive towards small changes in the positions of the points of the point processes. Our main 
effort thus lies in determining error bounds on the approximation of a Poisson process by another 
Poisson process. As the error given by Theorem 2.5 is only P{X G A), the error obtained by 
further approximating by a different Poisson process is typically the bigger of the two, both in the 
univariate and bivariate case. 

For the MPPE with univariate geometric marks, we specifically add, to the first approximation 
by a Poisson process with mean measure equal to that of the MPPE, a further approximation by a 
Poisson process with continuous intensity function. For the MPPE with bivariate marks, we pro¬ 
ceed by "spreading out" the point probabilities of the Marshall-Olkin distribution over the entire 
space. As the intensity function obtained through this depends on n, we make some additional 
assumptions on the parameters of the Marshall-Olkin geometric distribution and further approx¬ 
imate by a Poisson process with a "continuous" mean measure independent of n. By adding up 
the ^ 2 -error bounds arising from each step, we give the total error bound for the approximation of 
the MPPE by the final Poisson process. 

The structure of the paper is as follows: in Section 2 we recall necessary basic definitions as well 
as results from the Stein-Chen method. In Section 3 we treat maxima of univariate geometric ran¬ 
dom variables, as well as joint maxima of random pairs that follow the bivariate Marshall-Olkin 
geometric distribution. In Section 4 we first study the MPPE with univariate geometric marks, 
and then the various steps involved in approximating the MPPE with bivariate geometric marks 
by a Poisson process with "continuous" mean measure independent of n. 

2 Background 

Throughout, let be a locally compact separable metric space. In later applications, we simply 
use E C W^, d > 1. Let E be equipped with its Borel ci-algebra £ := B{E). Suppose that ^ is 
an integer-valued Radon measure on £. Denote by Mp{E) the space of all such point measures 
^ on E and equip Mp{E) with the ci-algebra Aip{E) that is the smallest cr-algebra making the 
evaluation maps ^ —)■ ^{B) from Mp{E) to [0, oo] measurable for any sef B £ £. Similarly, denote 
by Mp{E) c Mp{E) the space of all finite point measures and equip Mp{E) with the ci-algebra 
Mp{E) ■.= Mp{E)nMp{E). 

Let {£l,E,P) be a probability space. We define a point process on EJ by S : {£l,P,P) —> 
{Mp{E),M.p{E)), u) 1 -^ S(a;) = ^ and denote its intensity measure or mean measure on £ by A. Fur- 
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thermore, we denote the law of Poisson point processes E on E with mean measure A by PRM(A). 
For a general review of point processes, we refer to Resnick (1987). 


2.1 Distance between measures 


We recall here some known facts on a probability metric, the d 2 -Tne\x\c, that is weaker than the 
total variation metric, defined for example in (Barbour and Brown, 1992, Section 3). Let do be a 
metric on E that is bounded by 1. We now define metrics on both the space Mp{E) of finite point 
measures over E and on the set of probability measures over Mp{E). Let JC denote the set of 
functions k : i? —)■ M such that 


Si{k) 


sup 

ziy^Z2&E 


k{zi) - n{z2)\ 
do{zi,Z2) 


< + 00 , 


which implies that for all zi ^ Z2 & E, \n{zi) — k,{z2)\ < si{K)dQ{zi, Z2)- Thus each function k £ 1C 
is Lipschitz continuous with constant si(k). Define a distance di between two finite measures p 
and <T over E by 


dl(p,cr) 


/ 

1 , 

J_ \Ie - Je 

, P{E) kgk si{k) 


iip{E)^cT{E), 
if p{E) = a{E) / 0. 


( 2 . 1 . 1 ) 


Note that di is bounded by 1. We can use di as distance between point measures in Mp{E). The 
di-distance is then a Wasserstein metric induced by do over point measures on E. 

We next construct a metric d 2 that is a Wasserstein metric induced by di over probability mea¬ 
sures on Mp{E). Let PL denote the set of functions h ; Mp{E) -£■ M such that 


S2{h) 


sup 


mi)-h{i 2 )\ 

di (6 >6) 


< oo, 


( 2 . 1 . 2 ) 


i.e. each function h £ PL is Lipschitz continuous with constant S 2 {h). We define a distance d 2 
between probability measures p and v over Mp{E) by 


d2{p,v) = 


—— sup 
S2[h) 


/ hdp — / hdu 

'Mp{E) JMpiE) 


(2.1.3) 


Note that d 2 is bounded by 1. It can readily be shown that d 2 (p, u) < dTv{p, e). 


2.2 The Stein-Chen method 

We briefly recall the theory of the Stein-Chen method, which was first worked out by Chen (1975). 
Let Z be a random variable with law p. A characterising operator for p is an operator Ap on some 
class of functions E such that, for any random variable X, 

E[Apf{X)] = 0 'Pf£E iffX-p. 
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The idea of the method is that IE [A^f{X)] determines the "distance" between Z and X. To de¬ 
velop this idea, given a function h £ T-L (for example, indicator functions of intervals), we need 
to find a function fh such that A^fh{x) = h{x) — 'E[h{Z)\. Hence this yields \E[Af^fh{X)]\ = 
|E [/i(X)] — E [/i(^)]| . If the left-hand side is small, then |E [/i(X)] — E [/i(E’)]| is also small. A de¬ 
tailed description of the Stein-Chen method can be found, other than in the original article, in 
Barbour (1997). We recall also the following results to keep the paper self-contained: let Z = 
{Zt,t G M+} be an immigration-death process on E with immigration intensity and unit per- 
capita death rate. For any bounded h £%, let the function y : Mp{E) —^ M be given by 


y(6 


|E«/i(Zt) - PRM(A)(/i)| dt. 


Then 


Proposition 2.1. y(^) is well defined, and |y(OI < oofor each k G Z+. 

Lemma 2.2. if 7 is defined as above for h : Mp{E) — > M any function in "H, define 

Aiy= sup |y(f+ 4)-y(6l • 

^&Mp{E),z(SE 


Then 


Aiy < sfih) min < 1 , 


1.65 

71 

Proposition 2.3. The function y satisfies the Stein equation 

(Ay)(0 = Me)-PRM(A)(/i), 


for all C£ Mp{E). 

Proofs of the above can be found in (Barbour et al., 1992, Prop. 10.1.1), (Barbour et al., 1992, 
Lemma 10.2.3) and (Barbour et al., 1992, Prop. 10.1.2). We also remark that, by Proposition 2.3, 


|E(Ay)(H)| = |E/i(H)-PRM(A)(/r)| 



hdC{E) 



/idPRM(A) , 

(E) 


( 2 . 2 . 1 ) 


2.3 General results on approximation by Poisson processes 

We use an argument first made by Michel (1988) to establish a bound on the total variation distance 
between a point process and an "easier" Poisson random measure. 

Theorem 2.4. For each integer n > 1, let Ii,... ,In be Bernoulli random variables with probability of 
success P{Ii = T) = Pi £ (0,1). Let E be a locally compact separable metric space and let Xi ,..., A„ 
be i.i.d. E-valued random variables, independent of the Ifs. Moreover, let H = X]r=i dnd let W = 
i:Uh.Then, 

ciTv(/^(H),PRM(E“)) < dTv{C{W),F>oi{m¥)). 
is a finite measure over E with \{E) = A. 
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Proof. Let Zi,..., be i.i.d. random variables with distribution C{Xi), and let them be inde¬ 
pendent of W. Then the process has the same distribution as the process of interest H. 

Furthermore, note that a PRM(IEH) can be realised as ^Zj, where W* ~ Poi(IElL) is indepen¬ 
dent of the Zj's. Then the result follows from this representation of the two processes, using the 
law of total probability. □ 

With this theorem at hand, we are able to show 

Theorem 2.5. For each integer n > 1, Zet Xi,... ,X„ be i.i.d. copies of a d-dimensional random vector 
X with state space E C where d > 1. For a fixed set A e £, define Hyi := t^nd 

Wa-.= T.Uh^.eA}-Then, 

dTv{C{E a),PRM{EEa)) < dTv{C{W a),P oi{EW a)) < P(X G ,4). 

Proof. The first result is a direct application of Theorem 2.4. An application of Theorem 1 by 
Barbour and Hall (1984) yields the second inequality. □ 

The next two Propositions are giving useful bounds on the dTv and d 2 distances respectively 
between point measures. For the second one, we will briefly recall some notions and introduce 
references to understand the results. The first concept is that of Palm density, about which the 
reader will find a more thorough introduction in Kallenberg (1986, Chapter 10). Suppose H is a 
point process on E with cj-finite mean measure A. For any z £ E, a point process is called Palm 
process associated with Eat z if, for any measurable function f : E x Mp{E) -a- 


E [ f{z,E)E{dz) =E [ f{z,E,)X{dz) . (2.3.1) 

Je J Ue 

Intuitively, a Palm distribution of a point process S at a prescribed location 2 : is the distribution of 
H conditional on the presence of a point of H at 2 . Note that a process S is a Poisson process if and 
only if 

^(^z) = + dz) A-a.s. 

Finally, we recall the following useful results: 


Proposition 2.6. Barbour et al. (1992, p.235) Let X and X be two finite measures on E. Then 

drv (PRM(A),PRM(A)) < J |A-A|(cZ 2 ). 


Proposition 2.7. Brown and Xia (1995, Theorem 1.5) (version for Poisson random measures) Let A, A be 
two finite measures on E, and let A = X(E) = X{E). Then it holds that 

d 2 (PRM(A),PRM(A)) < (1 - e"^)di(A, A). 

Remark 2.8. We highlight that the pseudo-metric d[ used in Brown and Xia (1995) for the above 
result is equal to the cZi-metric in our setting where X(E) = X(E). 
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2.4 The bivariate Marshall-Olkin geometric distribution 


The bivariate Marshall-Olkin geometric distribution arises as a natural generalisation of the geo¬ 
metric distribution to two dimensions. It was first introduced by Hawkes (1972) and later studied 
by Marshall and Olkin (1985) as the discrete counterpart to their bivariate exponential distribu¬ 
tion, first derived by them in Marshall and Olkin (1967) using shock models. Limit distributions 
for maxima of i.i.d. Marshall-Olkin geometric random pairs were established in Mitov and Nadarajah 
(2005) and Feidt et al. (2010). 

Underlying the Marshall-Olkin geometric distribution are Bernoulli trials. Suppose S and T 
are two Bernoulli random variables with joint probability mass function P{S = i,T = j) = pij, 
for all i,j = 0,1, and let {Si, Ti), {S 2 , T2), ... be i.i.d. copies of (5, T). Let Xi and X 2 denote the 
numbers of Os before the first 1 in the sequences 5 i , ^2 ,... and Ti , T2 ,..., respectively. Obviously, 

Xi and X 2 follow geometric distributions with failure probabilities qi := P{S = 0) = poo + Pm 
and q 2 := P{T = 0) = poo + Pio, respectively. Their joint probability mass function is given by 


P{Xi=k,X2=l) 


Foo 92~^(1 - Poo/92 - 92 + Poo) 
' Poo(l - 91 - 92 +P 00 ) 

, Poo9i"'(l - 91 -Poo/91 +P 00 ) 


for k < I, 
for k = I, 
for k > I, 


(2.4.1) 


for any {k, 1 ) G where Z_|_ = {0, 1,2,.. .}. The distribution of X = (Xi, X 2 ) thus depends on 
three parameters: the two marginal failure probabilities qi and q 2 , as well as poo = P{S = 0,T = 
0), the probability of joint failure. We assume that poo > 9 i 92 - We have 


P{Xi >k,X2> 1) 


Poo92 "^ for /c < 

< pgo for k = l, 

, Poo9?"' for /o L 


(2.4.2) 


Note that if poo = 9 i 92 / the Marshall-Olkin geometric distribution corresponds to a bivariate dis¬ 
tribution with independent geometric margins. 


3 Rates of convergence for maxima of geometric random variables 

In this section we determine bounds on the error of the approximation, in the Kolmogorov dis¬ 
tance, of the laws of maxima of univariate geometric random variables and bivariate Marshall- 
Olkin geometric random pairs by appropriate limit distributions. The one-dimensional case is 
treated for example by Mitov and Nadarajah (2002) where they see that, for Xi,..., Xn i.i.d. geo¬ 
metric random variables with probability of success p = pn & (0,1), if p„ —> 0 as n —> 00 then, for 
all X G M, 

lim P 

n^ + CXD 


X{n) < 


log n + X 
Pn 


= e 


(3.0.3) 
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The following proposition investigates the rate of convergence of this limit result and suggests 
two improvements. One way to reduce the error is to approximate by a discretised version of the 
Gumbel distribution, the other is to use different normalising constants. 

Proposition 3.1. For each integer n > 1, let Xi,..., he i.i.d. geometric random variables with success 
probability pn G (0, 1), failure probability qn = 1 — Pnr probability mass function P{Xi = k) = Pnln 
survival function F{k) = Qn'^^/for any /c G Z_|_ = {0, 1,2,...}. Then: 


(a) (Approximation by a discretised Gumbel distribution) For all /c G Z+ and for all /c* G M defined by 
k* = - logn + klog{l/qn), 


P 



log n + k*\ 
\og{l/qn) J 



logn 

qnU 


H —• <5poiAppr- 
n 


(3.0.4) 


(b) (Approximation by a Gumbel distribution) For all x G M, 


P X(„. < 


log n + X 


log(l/g„ 


— e 


^ <5poiAppr Te log(l/(7n) —• (^Cont- 


(c) (Using the normalising constants from Mitov and Nadarajah (2002)) 


P 



log n + x\ 
1--qn ) 



< (Jcont + (log^ n + e ^) . 


Note that the failure probability qn need not vary with the sample size n for approximation by a 
discretised Gumbel distribution. The error bound is sharp for any constant qn ^ q ^ (0,1), show¬ 
ing clearly that it makes more sense to approximate a discrete distribution by another discrete 
distribution than by a continuous one, as there is no need to add an extra error as in (b). 

The extra error in (b), log(l/g„), is the discretisation error that arises when going from the 
Gumbel concentrated on the lattice of points k* to the continuous Gumbel distribution over M. 
It dominates the overall error in (b) unless qn tends to 1 fast enough as n —^ oo, that is, unless 
1 — qn = 0 (log(n) /n), in which case the discretisation error is of the same order as the first error 
term from (a). 

The upper bound in Part (c) shows that the choice of normalising constants or more precisely, 
of the scaling by pn in (3.0.3), may not be the optimal one. In order for the approximation in 
(c) to be good we require pn = o(l/ log^ n). Its being a stronger condition than the one for the 
asymptotic result from (3.0.3) is justified by (c) also being a stronger result in the sense that it 
gives a uniform bound. Note that the error in (c) is of fhe same order as the error in (a) only if 
1 - gn = 0 (l/(nlogn)). 

Proof. For ease of notation we omit the subscript n. 
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(a) Let A = [y, oo) for any choice of y > 0. Then P{Xi ^ A) = , and, setting k := [y] G Z+, 

Theorem 2.5 gives 

P(X(,) <fe)<(7^ (3.0.5) 

With k* G M chosen such that k = (log n + k *)/ log(l /q), we then have 


logn + fc^ 
log(l/g) 


— e 


< 


n 


(3.0.6) 


In order to find a uniform bound for all k G Z+, choose an auxiliary point defined as xq := 
xon '■= — log log n. We fhen study the error term in two cases: 

• k: k* > xq. We have exp (—A:*)/n < exp(—xo)/n = log(n)/n. 

• k: — log n < k* < xq. We have k* < m* < xq where 

m* := [(logn + xq)/ log(l/y)J log(l/g') - logn. 

Since distribution functions are non-decreasing and using (3.0.6), 

log n -I- A* \ / log n -I- m*' 


^ ^ log{l/q) 




< 


+ e 


n 


Therefore we obtain 


log n + k^ 
log(l/y) 


— e 


< 

- -6 e 

n 


e-*o 


< 

+ e“' 


qn 


log n 1 

- 1 —) 

qn n 


(3.0.7) 


where we used m* > xq — log(l/y) in the last inequality. 
(3.0.7) provides a bound for all k*, and thus 


logn + fc* 
log(l/y) 


— e 


log n 1 

<- 1 —. 

qn n 


(b) Let X = log(l/y)y — log n. By adding and subtracting exp {—e into (3.0.4), and noting that, 

since y < [y] = k, we have x < k* and exp{—e“^*} — exp{—e“*} > 0. We thus obtain 


P < 


log n -|- X \ 


log(l/y) ) 


— e 


^ <5poiAppr T 0 


— e 


where 


— e®"</ e*e®"di<e ^{k* — x) = e ^ log(l/( 7 )(|'y] — y) < e ^ log(l/y). 


-1 


.,-1 
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(c) From (3.0.6) and using x < k* as defined in (b) one has 


P < 


log n + x\ 


^ log(l/g) ) 


— e 


<-+ e-" -e-" <-+ e-Mog(l/g), (3.0.8) 


n 


n 


where we use the upper bound e ^ log(l/ q) derived in (b). Choose x' > — log n such that 

log n + X log n + x' 


y = 


log{l/q) l-q 
Then from (3.0.8) and observing that x > x' (since log(l/g) > 1 — q) 

log n + x'' 


P I ^(n) < 


1 - q 


— e 




<-he log(l/g) 


n 


By adding and subtracting e ® "" in the above expression we get 

log n + x' 


For the latter error term we find 


— e 


< ^ + e-' log(l/g) + 
n 


1 _ e-('^ " ")] < e-^' [l - 


< e ^'(x-x') = e [log(l/g) - (1 - g)] y, 

where we used exp{—e“^} < 1 in the first inequality and 1 — < z for z > 0 in both 

inequalities. Note that use of the definition of the logarithm and the geometric series give 


OO / -| \ '1 oo 

log(l/g) - (1 - g) = ^^ ^ 


(1 - qy 1 


i=2 
1 
2 


- -2 + q 
Q 


i=2 

2q 


Y^il-qy-l-(l-q) 

j=0 


Then, 


-e-^^ _e-^' (1 - -x' -x' ^ ^ ~ 1 / -x'^ , -In 

® - e ® < 2— e * = —:—(logn + x )e ^ < -(e ^ logn + e ^). 


2q 


2q 


2q 


Thus, 


P I ^(n) < 


log n + x’' 
1 - q 


^ - e ® + e ^ log(l/g) + logn + e . 


For a uniform bound over all x', we choose xq as before in (a) and (b), and obfain, with an 
analogous argument, the overall error bound 


logn 


l-q 


1 


+ e Mog(l/g') H—-—(log^n + e j +-■ 
n 2q ^ ' n 
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□ 


We now pass to the bivariate case where we assume that the random pair Z = {X, Y) follows the 
Marshall-Olkin geometric distribution as defined in Section 2.4, taking values {k, i) G Z^. We use 
the following normalisation: 


Z* = {X\Y*) = 


(log (—) 
V \PooJ 


X — log n , log 


(-) 

\PooJ 


Y — log n , 


(3.0.9) 


taking values {k*,£*) G [—logn, oo)^. Similarly to (a) in Proposition 3.1 above, the following 
proposition provides bounds on the error of fhe approximation, in the Kolmogorov distance, of 
the distribution of maxima of Marshall-Olkin geometric pairs by a discrete limit distribution. 

Proposition 3.2. For each integer n > 3, let Z^,..., Z* fee i.i.d. copies of the random pair Z* = (X*, Y*) 
as defined in (3.0.9). Moreover let = maxi<j<„X* and similarly for Y*^y Then, for all {k*, £*) G 
[— log n, oo)^, 

log miii{<;x,<;2} 


p (x*„| < r, y,;, < t) - H(k', t)\ < (i) 


log POO 


Poo 


where 


y) = { 


l°g(pon/'i' 2 ) logg^ 

_e log POO log POO , 

e X < y, 


i°g(poo/'3'i) log n 


log POO 


log POO ' 


x = y, 

X > y, 

We remark here that the limit distribution H is different from the ones obtained in Mitov and Nadaraj ah 
(2005), who used different normalisations. The idea underlying the proof of Proposition 3.2 is to 
apply Theorem 2.5 to A = [k, oo) x [£, oo) and Wa = follows: 

P{Wa = 0) - < P{Z G A). 

The limit distribution H defined in Proposition 3.2 thus corresponds to _ 

Proof. 1. If A:* = £*, we choose the auxiliary threshold xq := — log log n. For all k such that k* > xq, 
Theorem 2.5 gives 


P{Xl^<k\Yf^^<k*]-e-^ 






< 


<- 


< 


logn 


(3.0.10) 


n n n 

For k such that k* < xq, we may bound the error in (3.0.10) (i.e. the absolute value in (3.0.10)) 
by further adding the limit distribution to the error bound at m*, where m := [yoj and yo := 
(log n + xo)/ log(l/poo), i-e. 

=.-^0 logn 1 




< 


-9 e 


_g-m* ^ ^ 




n npoo 

where we used xq — log(l/poo) < rn* < xq in the second inequality. 


-|- 

npoo n 
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2. Suppose instead k* < I* (the case k* > I* is symmetric). By Theorem 2.5 we have 

1 l°g(P00/g2) j.* l°g 92 £* 


We first assume 

Choose the following auxiliary points: 

log Poo 


_ l°g(pnn/ 92 ) f.* _ l°g TO 
_e log POO log POO 


< — 0 log POO 

“ n 


Xo := 


Poo > 92 


, , logpoo , 1 

log log n, Po := - 7 n-log log n. 


'°SP00 . (3.0.11) 


(3.0.12) 


2log (poo/ 92 ) ’ 21 ogg 2 

Note that, under (3.0.12), xq < po- Hence: 

(a) If k* > Xo, £* > po, then 

1 i°g(poo/g 2 ) /.* log 92 />* loen 

_0 log POO log POO < 2 _ 

n ~ n 

(b) If r < xq, t < Po, then the error in (3.0.11) may be bounded by further adding the limit 
distribution to the error bound at (k*, A*), with 


K : = 


Xo + log n 
[log(l/poo) 

i.e. the error in (3.0.11) is bounded by 


A := 


Po + log n 

log(l/poo)J ’ 


1 _l£g(PooZM,,*_i£S^A* , _e- ‘°t?Poo'" ^ ~ 

_ 0 log POO log POO + e 

n 

1 _ iog(Poo/g 2 ) ^„_i£g^^,_lp^„,, _ logn 1 . 


< _0 log POO log POO +e 


n npoo n 

where we used xo — log(l/poo) < k* < xq and po — log(l/poo) < A* < po in the inequality. 

(c) ifr > Xo, i* < Po, then we may bound the error in (3.0.11) by further adding the limit 
distribution to the error bound at the point (A*, A*). This yields an error bound of 


1 _ log(PooM) A*-^A* , 1 - 


n 


0 log POO log POO + e 


= — e 
n 


A* I e 


log POO 


< i g-?;o+log(l/poo) _|_ 0-e no _ (log^T-)^*”*^*^^ _l_ g-(logn)2°log 92 ^ log^ _|_ ^-Vlogn^ 

“ " npoo ~ npoo 


n 


where we used po - log(l/poo) < A* < po, and \ < (logpoo) / (21ogg2) < 1 since ql < 
Poo < 92- 

(d) If k* < Xo, f’* > Po, the error bound is given by 


1 _ l°g(P 00 /-; 2 ) fc*_ log 12 p, 1 l°g(P 00 /l 2 ) '°S 12 J\o^n logjl 

— 0 log POO log POO < — e noo ^ log POO " = — < ■ ' 


n 


n 


log <12 — 

7 ^ log POO 


n 


since k* > — logn and ^ by (3.0.12). 
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Therefore, if (3.0.12) holds, an overall bound on the error in (3.0.11) is given by 

If rnsfead poo < Q 2 (hence xq > -ijq), we consider only the cases (a), (b), (d) as above (since we 

must have k* < i*). Hence 


i. If k* > xo, i* > Xq, fhen the error bound is given by 

1 l°g(P00/g2) J.* losn f * 1 l°g(P00/92) ^ log'72 


e ^0 ^ logn 


— e log POO log POO < — e logpoo ^ logpoo ” = 

n ~ n n 


< 


n 


Here we have used the fact that poo < Q 2 iiriphes 2 \og{poo/q 2 ) — 


ii. If k* < Xo, < Xo, fhen, choosing k := X := 
bounded by 

log(pnn/<; 2 ) * log go 


loRPOo < I 

, fhe error in (3.0.11) may be 


XQ+logn 

log(l/poo) 


1 log(P00/92) loggQ j.* _ 

— e log POO log POO + e ® 
n 

log POP 

Q -^0 _g-i^o (log n) 2 i°g(poo/'J 2 ) 
^ ~t~ e 


log POO log POO 


log POO 


n 
logn 


+ e 


+ e“2iog(poo/i2) < ^ 


npoo npoo npoo 

where we used xo — log(l/poo) < k* < xq in the first inequality and > poo in the second, 
iii. If k* < Xo, I* > Xo, as in (d) we obtain 

log 92 


1 iog(pnn/<; 2 ) j.* log 92 g* 1 iog(pnn/q' 2 ) ^ e '“spoo 

_ 0 log POO log POO < — e POO ^ log POO “ < - 

n -n - 


XQ 


^/logn 


log 92 


7j log POO n *°® POO 

Thus, if Poo < gl, an overall bound on the error in (3.0.11) is given by -h 

n log POO pqq 


□ 


4 Rates of convergence for MPPEs with geometric marks 

4.1 Univariate geometric marks 

In Proposition 3.1 we demonstrated that for maxima of geometric random variables the approxi¬ 
mation by a discretised Gumbel distribution living on lattice points k* gives a smaller error than 
the approximation by a continuous Gumbel distribution on M. For the latter approximation to 
be sharp, we need the condition that the failure probability Qn depends on n in such a way that 
1 — = 0 ( 1 /log n) for n —)■ 00 . We encounter a similar behaviour when approximating an 

MPPE with geometric marks, defined by Ea := ^{Xi&A}^Xi, by a Poisson process. The sef 

A G ^([0, 00 )^) will, in all further applications, be chosen such that points falling into A can be 
considered "extreme". We consider here the MPPE 

n 

:= (4-1.1) 

i=l 
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the normalised version of (4.1.1) where the marks are subject to the normalisation used in Propo¬ 
sition 3.1 (a). Proposition 4.1 below gives the error in total variation of the approximation of the 
law of by a Poisson process with mean measure living on the lattice E* of normalised points 
k*. On the other hand. Proposition 4.2 determines the error of the approximation by a Poisson 
process with an easy-to-use continuous mean measure, and uses the ^ 2 -metric to achieve this. 


Proposition 4.1. For each integer n> I, let Xi,..., Xn be i.i.d. geometric random variables with failure 
probability q G (0,1) (ind P{Xi > y) = q^y^,for any y > 0. Define the normalised random variables 
X* = \og{l/q)Xi — logn, i = 1,... ,n, taking values in E* = log(l/g)Z+ — logn. Let A* = [rt*,oo) 
for any choice ofu* G [— logn, oo), and let be defined as in (4.1.1). Then the mean measure ofEj^i, is 
given by 

7r*(B*) = (1 — q)e~^ , for any B* ^ B{[—logn, oo)), (4.1.2) 

k*&A*nE*nB* 

and 

_ 

drv {C(~%),FLm(7r-)) <- -. 

n 

Proof. For all k G Z+, we use the normalisation k = (k* + log n)/ log(l/ q), where k* G E*. We then 
have P{Xi = k) = (1 — q)q^ = (1 — q) = P{X^ = k*), and, for any B* G B([— log n, oo)), 

7r*(B*) =nP(X* Gyl*nS*) = nP{Xl = k*)= Y (4-1.3) 

fc*eA*nE*nB* A:*eA*nE*nB* 


Using Theorem 2.5, we obtain 

(iry(/l(Z:^*),PRM(7r*)) < P{Xl >u*) = P{Xi> 


^' + log^ V 

log(l/g) 


< 


n 


□ 


The following proposition now uses the (i 2 -metric to approximate the MPPE with geometric 
marks by a Poisson process with continuous intensity, as the total variation metric is too strong 
to achieve this. The continuous intensity measure we aim for is the same as that of an MPPE 
with exponential marks. The result is achieved in two steps: we first estimate the error in the d 2 - 
distance of the approximation by a Poisson process with mean measure given by (4.1.2), and then 
compare this Poisson process by another one with the desired continuous mean measure, again 
in the (i 2 -distance, by making use of Proposition 2.7. We assume here that do is the Euclidean 
distance on M bounded by 1, i.e. ^ 0 ( 2 ^ 1 , 22 ) = mind^i — 2 ; 2 |, 1) for any zi,Z 2 G M, and define the di- 
and d 2 -distances as in (2.1.1) and (2.1.3), respectively, in Subsection 2.1. 

Proposition 4.2. For each integer n > 1, let Xi, Xf, i = 1,..., n, and E* be defined as in Proposition 
4.1. Let A* = [u*, 00 ) for any choice ofu* G E*, let he defined as in (4.1.1) with mean measure tt* 

as in (4.1.2), and define the continuous measure X*{B*) = e~^dx,for any B* G B{[— log n, 00 )). 

Then 

_ 

d 2 (£(S^*),PRM(A*)) < ^-hmin{log(l/g), 1}. 

n 
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Proof. We have 

d2 {C{E\.),PRM{X*)) < d2 (£(H^*),PRM(7r*)) + d2 (PRM(7r*), PRM(A*)), 
where, by Proposition 4.1, 

ci2(£(H^*),PRM(7r*) < dry (/^(H:^*),PRM(7r*)) < ® 


n 


It thus remains to determine an estimate of d 2 (PRM(7r*), PRM(A*)). Since A*(yl*) = e *dx = 
e““* = 7r*(,4*), Proposition 2.7 gives 

d2(PRM(7r*),PRM(A*)) < (^1 - di(7r*, A*) < (ii(7r*, A*). (4.1.4) 

By Definition (2.1.1) of the di-distance, 

/ CO 1*00 

K(x)7r*(dx) — / K(x)A*(dx) . (4.1.5) 

- log n J — log n 

We may write the two integrals in the above expression as a sum of integrals over the "normalised 
unit intervals" [k*, {k + 1)*) = [k*,k* -|- \og{l/q)), for all k* C E* D [n*,oo). The modulus then 
equals 


E l ^fc*+log(l/q) i‘K'+log(L/q) 

< / K(x)7r*(dx) — / K(x)A*(dx) 

k*>u* 


A:*+log(l/(j) 


(4.1.6) 


Since tt* is concentrated on the lattice points /c* G i?* n [u*, oo), we have 
ffc*+log(l/<?) 

/ K(x)7r*(dx) = K{k*)7T* {{k*}) = K{k*){l — q)e 


-k* 


Note that we obtain the same result by computing 

ffc*+log(l/(?) 

/ K(A:*)A*(dx) = K{k*) 

Jk* 

We may thus express (4.1.6) as follows: 

fA:*+log(l/g) 


ffc*+log(l/ij) 


e *dx = k(/c*)(1 — q)e 


-k* 


E 

k*>u* 


{K{k*) — «:(x)} A*(dx) 


s E 

k*'>u* 


f>A:*+log(l/q) 


|k(A:*) — k(x)| A*(dx). 


From Lipschitz continuity of k, we know that \K{k*) — «:(x)| < si{K)do{k*,x) for any x G [k*, k* -|- 
log(l/( 7 )), where k* C E* n [u*, oo). The maximum Euclidean disfance between k* and any point 
in [k*, k* -|- log(l/g)) is of course given by log(l/( 7 ). Since we bound do by 1/ we have 

\K{k*) — k{x)\ < si(k) min {log(l/g), 1} . 

For the di-distance in (4.1.5) we now find, using A*([tt*, oo)) = e““*. 


di(7r*,A*) < ^ 


r'fc*+log(l/<?) 


min{log(l/g), l}\*{dy) = min{log(l/g), 1} , 


k*'>u* 


which we plug into (4.1.4) to obtain an estimate for d2(PRM(7r*), PRM(A*)). 


□ 
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The approximation of by PRM(A*), whose continuous intensity function corre¬ 

sponds to that of MPPEs with exponential marks, gives rise to an additional error term which 
depends only on the failure probability of fhe geometric distribution. The error will become small 
only if we allow the failure probability q = qn \-0 tend to 1 as n —)■ oo. Since \og{l/qn) is the length 
of the normalised unit intervals, this condition causes the lattice structure to melt into the whole 
real subset [— log n, oo) as n —)■ oo. Note that Proposition 4.2 does not require to vary at a par¬ 
ticular rate. The reason for that is that we chose the threshold n* as element of the lattice E*. If we 
had not done so, we would have obtained an additional error term of size log(l/( 7 n)e~“™. In this 
case, qn would have needed to vary at a fast enough rate to guarantee a small error despite the 
factor which roughly corresponds to the expected number of exceedances and should thus 
be greater than 1. We refer to Section 4.2.4, where we established the error estimate in full defail 
for MPPEs with bivariate geometric marks. 

4.2 Bivariate Marshall-Olkin geometric marks 

4.2.1 Approximation in dry by a Poisson process on a lattice 

For any integer n > 1, let Xi,..., X„ be i.i.d. copies of the random pair X = (Xi, X 2 ), which 
follows the Marshall-Olkin geometric distribution from Section 2.4. We use fhe following normal¬ 
isation for sfudying joint threshold exceedances, introduced in Section 3 above: 

= {klog{l/poo) -logn, nog(l/poo) -logn), for any (/c,/) G (4.2.1) 

and denote by E* the lattice of normalised points {k*,l*). The following proposition gives straight¬ 
forward error estimates for the approximation of the law of by that of a Poisson process with 
mean measure ES^*, both for general sets A*, and for the particular choice A* = [u*,oo)^ (for 
which captures joint threshold exceedances of the components of fhe normalised random 
pairs Xf,...,X*). 

Proposition 4.3. Suppose X = {Xi, X 2 ) follows the Marshall-Olkin geometric distribution with parame¬ 
ters ( 71 , ( 72 , Poo £ (0) 1). For each integer n > l,letX\,..., X* be i.i.d. copies of the normalised random pair 
X* = {X^,Xf) with state space E*, where Xj = log{l/pQo)Xj— logn, for j = 1,2. Let A* G .B([0,oo)^) 
and let and IP^* be defined as follows: 

n n 

‘=‘A* = iind WX* = 

i=l i=l 

Then the mean measure of E^* is given by 

7r*iB*) -.= 7r%iB*) := EE^*(il’^) = ^ nP(X,* = kfX^ = E), 

(k* ,i*)&A*nE*nB* 
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for any B* G B{[— logn, oo)^), where, for any (A:*, I*) G E*, 


P{Xl = k\X^ = E) 

i°s(pnn/92) I.* log 12 I* 

i (1 - ^ - g2 + Poo) e '°SJ’00 e '°g^>oo 
= ' ^ (1 - - ^2+Poo)e"^* 

log 91 u* log(P00/gl) I* 

i (1 - gi - ^ +Poo) e '°spoo e '°spoo 


for k* < I*, 
for k* = I*, 
for k* > I*, 


(4.2.2) 


and d-Tv {B ,PRM{7 v*)) < P(X* G A*). With A* = ^* = [u^, oo)'^ for any choice of > 
— log n, we obtain 

6 

dTV {C {E%), PRM(7r*)) < -. (4.2.3) 

n 

Proof With (2.4.1) and (4.2.1), we obtain (4.2.2). For any B* G B{[— log n, oo)^), the mean measure 
of applied to B* is given by 


nP(X* e A* nB*)= nP{X* = k*,X* = I*). 

(k* ,i*)&A*nE*nB* 

By Theorem 2.5, drv (B (H^*) , PRM(7r*)) < P(X* £ A*), where, using (2.4.2), we find 


P (X* G A*) = P Xi > 


tt* + log n 

log(l/poo) 


,X2 > 


ii* + log n 
log(l/poo) 


n*+log n ~| 

_ „ log(l/P00) ® 


= Poo 


n 


(4.2.4) 

□ 


4.2.2 Construction of a "continuous" intensity function 

Proposition 4.3 gives an error bound for the approximation of the MPPE by a Poisson process 
whose mean measure EH^* lives on the lattice of normalised points {k*, I*), i.e. on 

B* = (log(l/poo)^+ - logn)^ C [-logn,oo)^. 

We would however prefer to approximate the law of the MPPE by that of a Poisson process with an 
easier-to-use and more flexible continuous intensity measure A* = A^* living on yl* n [— log n, oo)^. 

As the survival copula of the Marshall-Olkin geometric distribution is a Marshall-Olkin cop¬ 
ula, and thereby consists of both an absolutely continuous part off fhe diagonal in [— log n, oo)^ 
and a singular part on the diagonal (refer to Section 3.1.1 in Nelsen (2006)), the "continuous" 
intensity measure A* will have to be of the form 

X*{B*)= [ X*{s,t)dsdt+ [ X*{s)ddts, (4.2.5) 

J A*nB* J A*nB*n{(s,t):s=t} 

for any B* G R([—log n, oo)^), for "continuous" intensity functions A* and A* that, if integrated 
over the entire space, will give n. 
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Remark 4.4. Note that for simplicity of language we here (and later on) somewhat abuse terminol¬ 
ogy when speaking of a "continuous" intensity function A* or a "continuous" intensity measure 
A*. The bivariate intensity function A* is not continuous, but piecewise continuous, having a jump 
along the diagonal. The measure A* is continuous only in the sense that it has an intensity with re¬ 
spect to Lebesgue measure (2-dimensional on the off-diagonal and 1-dimensional on the diagonal) 
and not with respect to a point measure. 

The idea is to spread the point mass sitting on each of the off-diagonal lattice points {k*, I*) G 
E*, k* / I*, uniformly over each of their corresponding coordinate rectangles (or rather, coordi¬ 
nate squares) 

= t,r + iogf—)) X [r,r + iogf—"jV fevr, 

L \PooJJ I \PooJJ 

and to also spread the point probabilities of fhe diagonal points {k*, k*) over the diagonal line 
s = t, where s,t > — log n. We achieve this in the following three steps, illustrated in Figure 1. 

Step 1. Consider only the off-diagonal lattice points. We of course have 

P (X* G = P {XI = k\X^ = P), 

which is given by (4.2.2), and we may express the mean nP{X* G A*) as 


E 

{k* 


nP{X*,=k\X^ = k*), (4.2.6) 


where log^(l/poo) is the surface area of As we aim to find a continuous intensity function 

over the entire space [— log n, oo)^, we exchange k* and I* in the expression of the point probability 
P{X^ = k*,X 2 = P) from (4.2.2) by s and t, respectively. E.g., suppose that k* < P. Then we 
replace the integral in (4.2.6) by 

r f 1 - Poofe ->!. + POO •e-a&‘d.dt = ^ ~ ~P(xr ^ r. x; = r 

J log^(l/poo) Iog{poo/q 2 )logq 2 

The switch to variable s and t thus results only in the multiplication of the original point proba¬ 
bility by a factor. The goal, however, is to integrate a function in s and t over and obtain 

the original point probability. This may be achieved by simply dividing the integrand by the 
multiplying factor . Hence, we rewrite the mean as follows 

j j X{s,t)dsdt + n Y = k*,X* = k*), 


{k* 


(k*,k*)eA* 


where 


Iog2(l/poo) 


(4.2.7) 
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Figure 1: Three steps to construct a "continuous" intensity function. 


Analogously, we find 

Iog2(l/poo) 


(4.2.8) 


(4.2.7) and (4.2.8) supply suitable choices for the intensity function on coordinate rectangles lying 
above and below the diagonal, respectively. 

Step 2. We expand X*{s,t) from (4.2.7) and (4.2.8) to the entire space (without the diagonal), 
i.e. we define 

l°g(poo/<? 2 )logg 2 * 


X*{s,t) := 


log (1 /poo) 


log POO e lospoo for s < t, 


log{poo/qi)logqi ^ 


log (1/poo) 


for s > t, 


for all (s, t) G [— log n, oo)^ . However, this adds surplus mass on the diagonal rectangles Rl^, 

Step 3. We adjust for the surplus mass on the diagonal rectangles by subtracting it from the 
point probabilities of the diagonal lattice points {k*,k*), and accordingly rewrite the mean as 
follows: 


[ fXn{s,t)dsdt + n ^ Ip{X 

{k*,k*)eA* { 


^ = k*,X^ = k*)-- 


1 

njR 


Xt,{s,t)dsdt} . (4.2.9) 


■k*,k* 


Computation of the term in curly brackets shows that the new mass that we put on the diagonal 
segments of each diagonal rectangle is given by 


n 


(1 - Poo) 


log(l/poo) 


- 1 


(4.2.10) 
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Note that this equals 


fA:*+log(l/poo) g-s r 


n 


log(l/gig 2 ) 

log(l/poo) 


- 1 


ds, 


for each k* G E*, where we have parameterised the intensity function on the diagonal as projection 
along the s-axis. We thus define: 


A*(s,f) = { 


iog(PooN2)iog.2 ^ 

log"(l/poo) 

log(poo/qi)logqi * 


log (1/poo) 


for s < f, 
for s > t, 


(4.2.11) 


A*(s) = 


log(poo/^N2) 


log(l/poo) 

The above construction guarantees the following: 


for s = t. 


Proposition 4.5. Let A*, A* and A* he defined by (4.2.5) and (4.2.11). Then, 

(i) A* = TT* {Rl.^i.),forany (k\E) G E\ 

/ poo 

X*(s,t)dsdt+ / A*(s)ds = n. 

-logn,cxD)2 J—\ogn 

□ 


Remark 4.6. It can readily be shown that the new intensity functions A* and A* may be expressed 
in the original coordinate system by 


X(x,y) = Xnix,y) = { 


nlog(g 2 )log 

nlog(gi)log(^) qrVoo 


91 


A(x) = Xn{x) = nlog 


Poo 

QiQ2 


POQ for x = y, 


for any (x,y) G [0, oo)^. We recognise a weighted and continuous 
from (2.4.2). 


for X < y, 
for X > y, 


version of P(Xi > k,X 2 > 1) 


4.2.3 Assumptions on the distributional parameters 

The continuous intensity measure A* defined by (4.2.5) and (4.2.11) depends on the parameters qi, 
q 2 and poo of the Marshall-Olkin geometric distribution. Our aim is to determine a bound on the 
error for the approximation of the Poisson process with mean measure living on the lattice 

E*, by a Poisson process with mean measure A*. As Section 4.2.4 will show, the probability of 
simultaneous success, pn, for the Marshall-Olkin geometric distribution, will have to tend to 0 as 
n —)■ oo. Since poo + Poi + Pio + Pii = 1/ this of course influences the distributional parameters 
Poo, gi and q 2 in that it also makes them dependent on n. The continuous intensity functions A* 
and A* thus have the drawback that, through their dependence on the parameters poo, gi and q 2 , 
they are also dependent on n. We thus try to find other suitable continuous intensity functions 
that no longer vary with the sample size. 
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For simplicity, we make the assumption that pio and poi vary at the same rate as pu = pun', 
more precisely, assume pio = pion = iPiin and poi = Pmn = ^Piin, where 7 and 5 are strictly 
positive real numbers, bounded such that pio and poi are smaller than 1. We assume that pun tends 
to 0 as n 00 at a rate that will be determined later, and express the distributional parameters as 
functions of it: 


qin = l- (1 +l)piln, 

g' 2 n = 1 - (1 + 5)Flln, (4.2.12) 

POOn = 1 - (1 + 7 + S)pun. 


Plugging into (4.2.11) and using the relation log(l — z) ~ —z for \z\ < 
A*(s, t) and A*(s) are, for > 0 as n —)• 00 , asymptotically equal to 


and A* 5 ( 5 ) := 


1 + 7 + (5 


e ® for s 


1 and z —)■ 0 , we find that 


s < t, 
s > t, 


(4.2.13) 


respectively, for all (s, t) G [— log n, 00 )^. At first glance A* ^ and A* ^ seem to be valid choices for 
continuous intensity functions independent of n. We will investigate in Section 4.2.5 whether a 
Poisson process with mean measure A* on A* may indeed be approximated by a Poisson process 
with mean measure 


[ [ \; 4 s,t)dsdt+[ x;,si^)ds, ( 4 . 2 . 14 ) 

J J A*r\B* J A*nB*n{{s,t): s=t} 

for all B* G B{[— logn, oo)^). To do the corresponding error calculations for a fixed sample size 
n we first need to examine in further detail the differences between the exponent terms in A*(s, t) 
and A*^^(s,f): 


Lemma 4.7. For each integer n > 1, let pun £ (0,1) and let gi„, q 2 n, Poon G (0,1) be defined by (4.2.12). 
Then, 


Moreover, 


and 


1 + _ log g 2 n ^ IPlln 

1 + 7 + (5 logpoon “ 1 - (1 + 7 + 6)piin ’ 

1 + 7 _ log gin ^ _ dpiin _ 

1 + 7 + (5 logpoon “ 1 - (1 + 7 + 5)piin ' 


[in) 

0 < log 1 

' q2n 

\P00n 

(iv) 

0 < log 1 

^ qin 
\P00n 


< 


IPllr. 


< 


log 


POOn 


log 


POOn 


1 - (1 + 7 + 6)pun ’ 

_ Spiln _ 

1 - (1 + 7 + 5)pun ’ 

(1 + 7 + (i)piln 


< 


qinq2nj {1 - (1 + 7 + (5)piln}^ 
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Proof, (i) For ease of notation we omit the subscript n. Since, for all | 2 :| < 1, — log(l — z)lz is 
increasing and —{l — z) \og{l — z)/z is decreasing, we obtain the following lower and upper bound, 
respectively, for -(logg 2 )/(logpoo)/ where q 2 < poo- 

1 + <^ ^ logg 2 ^ (1 + (5) • [1 - (1 + 7 + 5)pii] 

1 + 7 + “ logpoo “ [1 - (1 + <J)Pii] • (1 + 7 + <J)' 

Therefore, 

Q ^ 1 + _ log g2 ^ l + S (_ 1 - (1 + 7 + (5)pii ) 

“1 + 7 +(5 logpoo “ l + 7 + <J I l-(l + (5)pii J 

__ (1 + 5)'ypii _^_TPii_ 

(1 + 7 + (5)[1 - (1 + 5)pii] “ 1 - (1 + 7 + 5)pii ■ 


(iii) Moreover, since q 2 = poo + Pio, we have log{q 2 /poo) > 0. Using log(l + z) < z for positive z, 
we obtain 


log(g2/P00) = log 


Poo + PlO 


< 


PlO 


7Pii 


Poo / Poo 1-(1 + 7 + <5)pii 

(ii) and (iv) can be shown analogously to (i) and (iii), respectively. 

(v) We have 


log ( ^—) log ( j = (_iogpgp) {_iog(pQo +poi) - log(poo +Pio) +logpoo} 

\P00n/ \qinQ2nJ 


< (- log Poo) {- log Poo - log Poo + log Poo} = (- logpoo)^ < 
(1 + 7 + S)pu 


(1 -poo)^ ^ 1 -POO 


i’oo 


i’oo 


{1 - (1 + 7 + d)piiy 


□ 


We will use Lemma 4.7 to determine error estimates in Sections 4.2.4 and 4.2.5. 

Remark 4.8. We suppose here that 7 and 5 do not vary with n. However, the asymptotic equiva¬ 
lence of (4.2.11) and (4.2.13), and later results (i.e. Propositions 4.10 and 4.12, as well as Corollary 
4.13) also hold for the case 7 = 7 n and 5 = Sn- These results are thus actually stronger than we 
make them out to be. 

4.2.4 Approximation in d 2 by a Poisson process with continuous intensity 

We now determine the error of the approximation of the Poisson process with mean measure tt*, 
living on lattice points {k*, P) G A* n E* , and the Poisson process with continuous mean measure 
A*, living on A* n [— logn, 00 )^. Note that any not too small set A* G B{[— logn, 00 )^) contains 
subsets that are unions of coordinate rectangles Ry j*, i.e. of the form 

U ^ (4-2-15) 
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Figure 2: Examples of sets A*. 


where M* is a countable subset of E*. Let A* denote the biggest subset of A* of the form (4.2.15). 
In order to prove Theorem 4.9, we rely on arguments used by Brown and Xia (1995) used to show 
Proposition 2.7. More precisely, let y be as defined in Proposition 2.1 for h gT-L. 


Theorem 4.9. With the notations from Sections 2.4-42.3, we obtain, for a set A* G B{[— log n, oo)^). 


d2(PRM(7r*),PRM(A*)) < \/21og(l/poo) + min 


^ 1.65 


X*{A*\A*), 


(4.2.16) 


where A* denotes the biggest subset of A* of the form (4.2.15) with M* the biggest subset of E* such that 
A* C A*. 


Proof. Let H^r* ~ PRM(7r*) and Ex* ~ PRM(A*). We recall the standard techniques also employed 
in Brown and Xia (1995). Suppose that Z = {Zt,t G M+} is an immigration-death process on A* 
with immigration intensity A*, unit per-capita death rate, equilibrium distribution £(Ex*), and 
generator A. Furthermore, let PL denote the set of functions h : Mp{A*) —)■ M such that (2.1.2) 
is satisfied and let y : Mp{A*) —> M be defined by y(^) = — J^{E^h{Zt) — PRM(A*)}df, for any 
^ G Mp{A*). By Proposition 2.1, y is well-defined, and by (2.2.1), |PRM(7r*)(/i) — PRM(A*)(/i)| 
equals |E(yly)(H 7 r*)|. Proceeding as in the proof of Theorem 1.2 in Brown and Xia (1995), we find 


E(yly)(H^*) = E / [y(H^* -6 dz) - y(S^*)] (X*(dz) - 7v*{dz)), 

Ja* 


and thus 

|E(My)(H^*)l 


< 


1 


S2{h) S2{h) 


E 


-6 


1 


S2ih) 


[ [y(H^* -6 (5z) - y(H 7 r*)] {X*{dz) - Tv*{dz)) 

J A* 

[ , [y(S,r* +(^z) -y(S,r*)] ('^*W -7r*((iz)) 

J A*-^A* 


(4.2.17) 
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Again by the proof of Theorem 1.2 in Brown and Xia (1995), the first summand is bounded by 


1 — e 


-A{A*) 


di(7r*,A*))U-*+ A*(A*)-7r*(A*) 


min < 1, 


1.65 


A*(A*i 


<di(7r*,A*))U-*, 


where di{., .)|^* denotes the di-distance on A* (instead of on A*) and where we used A*(A*) = 
7 r*(A*) > 0 for the last inequality. The di-distance between A* and tt* on A* is given by 

I/d* - I A* 


di(7r*,A*)U* = 


sup 


A*(A*) K&k si{k) 

As A* is a union of coordinate rectangles we may express kcItt* — kcIX* as 


E 


>R* 


(k*,l*)eA* '' 

Furthermore, again by Proposition 4.5, 


(z)7r*(dz) — f K{z)X*{dz) > . 


(4.2.18) 


K{z)7V*{dz) = K{{k*,l*))7V*{Rl.i.) = K{{k*,l*))X*{Rl*^i*). 


k* ,1* 


Hence, we find the following upper bound for (4.2.18): 

y/ [ \K{{k*,l*)) - k{z)\ X*{dz) < si{K)do{{k*,R),z)X*{A*), 

where we also used fhe definition of the Lipschitz constant si{k). Since the biggest possible Eu¬ 
clidean distance between the lower left corner point {k*,l*) and any other point z in the rectangle 
;* is given by the length y/2 log(l/poo) of its diagonal, we have 

di(7r*,A*)U* < x/21og(l/poo). (4.2.19) 


For the second summand in (4.2.17), we note that, for any ^ E Mp{A*)^ 




[y(C + <Jz) - y(0](A*(dz) - 7r*(dz)) 


< 


[ |y(C + <Jz)-y(OI • 1-^*^-7r*(dz)| 

4A*\d* 


< Aiy / \X*{dz) — 7v*{dz)\ < Aiy • A*(A* \ A*) 

Ja*^A* 


and, completing the proof, that Lemma 2.2 gives 


(4.2.20) 


Aiy < S 2 {h) min < 1, 


1.65 




(4.2.21) 

□ 
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log(l/poo) 



Figure 3: The set yl* \ 


Theorem 4.9 gives sharp results only if the probability of simultaneous failure, poo = Poon tends 
to 1 as n ^ oo. This makes sense since log(l/poo)/ introduced as scaling factor of the original 
marginal geometric random variables, provides the side lengths of the rescaled lattice squares. 
The condition poon t 1 makes the side lengths of fhe coordinate squares tend to 0 and thus causes 
the "disappearance" of the lattice into the whole real subset [— log n, oo)^. The same holds for the 
area A* \ A* , thereby also causing the disappearance of fhe second error term as n —)• oo. 

For sets A* that are unions of coordinate rectangles, there is no left-over area A* \ A* , and by 
consequence no second error term. We now apply Theorem 4.9 to the case where A* = A^ = 
[u*, oo)^ and express the error estimate in terms of the threshold rt* and the probability of simul¬ 
taneous success Pi In- To achieve this we assume that the distributional parameters poo, qi and q2 
are defined as in Section 4.2.3. 


Proposition 4.10. Let pun G (0,1) and assume that qin, q 2 n poon satisfy (4.2.12). For any choice of 
— log n, define A* = [rt*, oo)^. With the notations from Theorem 4.9, 

d2(PRM(7r*),PRM(A*)) < .^ (l+7 + %iin + Smin |e-<, 1.65e-</2U , 

[1 - [l+J + d)punr ^ ^ 

Proof. For ease of notation we omit the subscript n. We apply result (4.2.16) from Theorem 4.9 fo 
the special case A* = [w*,oo)^. Due to (4.2.12) and — log(l — z) < z/{l — z) for \z\ < 1, we may 
bound the first of fhe two error terms in (4.2.16) as follows; 


^log(l/poo) < 


V2{1 + y + 6)pn 
1 - (1 -6 7 -6 S)pn 


^ \/2(l + y + 6)pn 

~ [1 - (1 -6 7 -6 ( i ) pii ]2 ' 


(4.2.22) 


Direct computation yields A*(^*) = e “*. 


As illustrated by Figure 3, A*(A* \ A*) may be bounded 
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by 


log(l/poo) sup X*{s,t)dt 

se[u*,k*] 


poo pk* 

+ / log(l/poo) sup X*{s,t)ds+ / \/21og(l/poo) sup A*(s)ds. 

J u* t£[u* ,k*] Ju* sG\u* .k*] 




Note that 


sup exp 

5 E[w*,/c*] 

sup exp 

tG[u* ,k*] 


log(P00/g2) 

log Poo 

log(Poo/gi) 
log Poo 


s > < exp < — 


■ u 


l og(POo/g2 ) 
log Poo 


, ^ , log(poo/gi) , 

t>< exp <-^- u 

log Poo 


and supgg[„* ^*] e ® < e “*. Thus, by definition (4.2.11) of X*{s,t), 

^ . Iog(poo/g2)logg2 

/ log(l/poo) sup A(s,f)df< -^—--—r-e '°spoo 

Ju* seK,fc*] log(l/poo) 

which equals log(( 72 /poo)e~“*. Analogously, 


log 12 


e '°gpoo df, 


log(l/poo) sup A*(s,f)ds < log(gi/poo)e “ , 


whereas 

I \/21og(l/poo) sup A*(s)ds < 21og^(l/poo) e~^*, 

Ju* se[u*,k*] log (1/Poo) 

since k* — u* < \/21og(l/poo)- We obtain 


A*(A* \ A*) = e <i log ( — ) + log ( — ) + 2 log 

Poo > 1 


gi 
VPoo 


(—) log( 
VPoo/ V 


Poo 


By Lemma 4.7 (iii)-(v), the term in curly brackets may be bounded by 

3(1 + 7 + 5)pii 


(7 + <^)Pii 2(1 + 7 + (5)pii 


< 


1 - (1 + 7 + (5)pil {1 _ (1 -I-^ {1 _ (^1 ^ 

An upper bound for the second error term in (4.2.16) is thus given by 

min L-<, 1 . 65 e-</d 3(l+7 + %im 

1 / [1 - (1 + 7 + 5)piln]2 

By adding this to the bound in (4.2.22) we obtain the result. 


□ 


The first of the error terms given by Proposition 4.10, i.e. \/ 2 (l+ 7 +( 5 )piin / [1 —(l+ 7 +( 5 )piin]^, is a 
bound on the error \/2 log(l/poon) from Theorem 4.9, where we used the assumption from Section 
4.2.3 that poon = 1 — (1 + 7 + (5)piin- This error term thus becomes small only if the probability of 
simultaneous success, pun, tends to 0 as n increases. The second error term, i.e. 


{ 


1.65e"</2| 


3(1 + 7 + 5)pun 
[1 - (1 + 7 + 5)punV ’ 
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is the bigger of the two, and determines the rate at which pun must converge to 0. The reason for 
that is that pun must converge fast enough in order to offset the effect of the factor which 
we will want to be increasing with increasing n, since = A*(yl*) is the expected number of 
points in ^4* of the approximating Poisson process, as well as more or less the expected number 
of threshold exceedances of the MPPE, for which we have e~^”/poon < 7r*(d.*) < For 

instance, for a threshold of size — log log n, the expected number of points in A* of the two 
Poisson processes is log n, the MPPE captures roughly the biggest log n points of its sample, and 
we need pun = o(log~^ n) for a sharp error bound. Suppose, for example, that pii„ = n~^. Then, 
by (4.2.12), the marginal probabilities of failure of X*, qin and q 2 n, as well as the probability of 
simultaneous failure, poon/ tend to 1 very fast. 

The mean measure A* is by definition dependent on the values of the distributional param¬ 
eters. Since these need to vary with the sample size n in order to obtain a small error for the 
approximation of PRM(7r*) by PRM(A*), it follows that A* = A* (and of course also tt* = tt*). 
Though we have now achieved the goal of successfully approximating by a Poisson process with 
a continuous intensity, the conditions needed to accomplish this imply that we are not satisfied 
with our results yet, since we prefer to approximate by a Poisson process with continuous inten¬ 
sity that does not vary with n. As the next section will demonstrate, a suitable candidate is given 
by the Poisson process with intensity measure A* ^ defined in (4.2.14). 

4.2.5 Approximation in d 2 and d-Tv by a Poisson process independent of n 

We determine an error estimate for the approximation of the Poisson process with intensity mea¬ 
sure A* = A* by the Poisson process with intensity measure A* 5, defined in (4.2.14), that does 
not depend on the sample size n. Since both intensities are continuous, there is no special need 
to use the (i2-distance. We give the error in both the total variation and the d 2 distances. For the 
error in total variation we may straightforwardly use Proposition 2.6 for the approximation of two 
Poisson processes. For the d2-error, which will be smaller than the dxv, we may additionally use 
Femma 2.2 for an upper bound on Aiy, where y is fhe solution of an adequate Stein equation. 

This bound, containing the factor A*(A*)“^/^ (or A* ^(A*)“^/^), serves in reducing the (i2-error. 

Theorem 4.11. With the notations from Sections 2.4 and 4.2.1-4.2.3, we obtain, for any set A* e B{[— log n, 00)^), 

(i) dTv {PRM{X-),PRM{\;s)) < [ \x*{dz) - Xfs{dz)\, 

J yl* 

{ii) d2 (prm(a*),prm(a;^ 5)) 

< minjl, 1.65min |a*(A*)“R^ , A* 5(A*)“R^|| J |A*(dz) - Xfs{dz)\. 

Proof, (i) By Proposition 4.5 (ii). A* is finite. Moreover, A* 5 is finite since integration of A* ^ and 
A* 5 over [u*, 00)^ gives e““* which equals n for u* = — log n. Proposition 2.6 then immediately 
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gives the result. 

(ii) Using the same immigration-death process Z and arguments as in the proof of Theorem 4.9, 
we can show that for ^ ~ PRM(A* ^), 

- PRM(A*)(/i) = + 4) - y{Elsm^{dz) - A;,(dz))| . 

Analogously to (4.2.20) and (4.2.21), the integrand may be bounded by 

- A*, 5 (dz)| < S 2 (/i)min|l, | A*(dz) - A* , 5 (dz)|. 

Here, 1.65(A*(A*))“^/^ may be replaced by 1.65(A* ^(A*))“^/^ by going through the same argu¬ 
ments as before, but instead starting with an immigration-death process over A* with immigration 
intensity A* unit per-capita death rate, and equilibrium distribution PRM(A* 5 ). □ 


We now again assume that the distributional parameters poon, Qin arid q 2 n satisfy (4.2.12) and 
apply Theorem 4.11 to the case where A* = A* = [u*, 00 )^. We express the error bounds in terms 
of the threshold u* and of the probability of simultaneous success pun- 


Proposition 4.12. Let pun £ (0,1) and assume that qin, q 2 n poon satisfy (4.2.12). For any choice of 
u* > — log n, define A* = [u*, 00 )^. With the notations from Sections 2.4-4.2.3, 

(') {PRM(A*),prm(a;,)) < <=-"= ■ 

(ii) d 2 (PRM(A*), PRM(A; 5 )) < min {e-<, 1.65e-</2} • 

Proof. For ease of notation we again omit the subscript n. 

(i) By Theorem 4.11, 


dry (PRM(A*),PRM(A;5)) < 


X*{s,t) - A*^5(s,f)| dsdf 




poo 

|A*(s,f) — A* 5(s,f)| dfds-|- / A*(s) — A* ,5(5) 

J U* 


ds. 


Define 


h := h(pii) 


1 + 5 log ^2 , , X 1 + 7 log 

-and q := qipii ) :=-. 

l + 7-h(5 logpoo 1 + 7 +(5 logpoo 


We first consider the case s = t. Note that, with definitions (4.2.11) and (4.2.13), 


^ log(POo/gig2) 
log(l/poo) 


log -I- log 92 2 -I- 7 -I- d 


log Poo 

1 


1 + 7 + (5 


1 + 7 + <^ 

- h{Pn) -9{pii) 


+ 


1 


1 + 7 + (5 
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and that, since h^g > 0 by Lemma 4.7 (i) and (ii), we thus have A*(s) < A* ^(s). Hence, 


roo 


V{») - \;4s) d. - (h + g) g-i, < 


again by Lemma 4.7 (i) and (ii). For s < t, note that 


X*{s,t) = 


7 


+ h 


l + <5 


-h 


'y ^ 1 d” ^ 4 - 1 /1 ^ 

e“ 1 + 7 +^ e~ i+ 7+'5 


'y gg 1 4- 1 / I ^ 

1+7+d 1+7+5 


[1 + 7 + 5 J [1 + 7 + 5 

where A*(s, t) and A* ^(s, t) are defined by (4.2.11) and (4.2.13), respectively. Thereby, 


A*(s,t)-A;5(s,t)e^(*-^) 


1 + 5-7 


where 


TT+ 


1 + 7 + 5 
< h + h?. Note that we have 


h-h^ 


0 i+7+a 1+7+5 *0^.7-^)^ 


X*{s,t) - A; 5 (s,t)| < |A*(s,t) - A;,(s,f)e'^(‘-^)| + A;,(s,t) - 1 


We first compute the following integral: 


ri 1+^ ,11.^, /■* l°s(poo/<;2) „ l°sg2 ^ 


■ 1+7+5 * 1+7+5 ^^dsdf = 


POO Pt 

J u* J u* 


0 log POO log POO dsdf = 


log POO 
log q 2 


where, using z < — log(l — z) < for all \z\ < 1, and (4.2.12), 


< 


1 + 7 + 5 


log Poo ^ -log[l- (1 + 7 + 5)pii] 
logg '2 -log[l - (1 + 5)pii] “ (1 + 5)[1 - (1+ 7 + 5)pii] 
1 + 7 + <5 f, , (1 + 7 + 5 )pii 


< 

- 1 + 5 

Moreover, by Lemma 4.7 (i), 

7Pii 


1 + 


[1 - (1 + 7 + 5)pii]^ 


h + h^ < 


+ 


7Pii 


< 


27Pii 


1-(1 + 7 + 5)pii ' [1 - (1+ 7 + 5)piiJ [1 - (1 + 7 + 5)pii]2’ 
since 7 pii = pio < 1, and therefore ( 7 Pii)^ < 7 Pii. Then, 

27(1 + 7 + 5)pii _„* 


n t 

A*(5,f)-A;,(s,f)e^(*-^) 

* 


dsdf < 


(4.2.23) 


(4.2.24) 


(4.2.25) 


(4.2.26) 


(1 + 5)[1 - (1+ 7 + 5)pii]3 

which gives a bound for the integral of the first error term in (4.2.23). For the second error term in 
(4.2.23), note first that “ 1| = — 1, since h >0 and t > s. By (4.2.24) and (4.2.25), and 

with definition (4.2.13) of A* ^(s, t), we obtain 

r " e-- 

Ju* Ju* (1 + 7 + < 5 ) log q2 


< 


7 


1 + 


(1 + 7 + 5)pii 


l + 7 + <Jl [1 - (1 + 7 + 5)pii]2 


(4.2.27) 
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whereas 

By (4.2.27) and (4.2.28), we may thus bound the integral of the second error term in (4.2.23) as 
follows: 


roo ft 


-^7,(SC'S) 




- 1 


dsdf < 


IPii 


Hence, for s <t, (4.2.23), (4.2.26) and (4.2.29) give 

n t 

|A*(s,f) - A* ,5(s,f)| dsdf < 


IPll 


[1 - (1 + ^ + 5)piiY 

2(1 + 7 + 5) 


[1 - (1 + 7 + (5)pii]^ 

By proceeding analogously for s > t, we obtain 


n s 

|A*(s,f) - A* ,5(s,f)| dMs 

.* 


< 


5pu 


l + <5 


2(1 + 7 + 6 ) 


[1 - (1 + 7 + (i)pii]3 \ 1 + 7 

The sum of the bounds for the three cases s = t, s < t and s > t yields 


(4.2.29) 


+ 1 > e 


+ 1 > e 


[ \x^idz)-x;4dz)\ 

J A* 

< __/ 7(1+ 7 +<^(1 + 7 + <^) , p-n* < 4(1 + 7 + (i)Vi . 

“ [1 - (1 + 7 + (i)pii]3 \ 1 + (5 1 + 7 J “ [1 - (1 + 7 + (5)pii]3 

where we used (1 + 7 )“^, (1 + <5)“^ < 1, and ^ + 5 < 1 + 7 + 5 < (1 + 7 + 5)^ for the second 
inequality. 

(ii) Direct computations give A*(yl*) = e““* = A* ^(,4*). Theorem 4.11 (ii), together with the 
bound from (i), then immediately gives the result. □ 


The error bounds established in Proposition 4.12 are similar to the error bound from Proposition 
4.10. As before, pun needs to converge to 0 fast enough to make up for the factor which 
increases the size of fhe error as soon as tt* < 0. And since > 0 gives 1 or no points in A*, the 
mean number of points in A* being given by for either process, we would want the threshold 

tt* to be negative. 

The biggest difference between the (i 2 -bounds from Propositions 4.10 and 4.12 is that the for¬ 
mer contains the multiplicative factor [1 — (1 + 7 + 5)piin]“^ and the latter the bigger factor 
[1—(l+7+5)piin]“^. However, since we need piin —)■ Oasn—)• 00 , we willhave (l+7+5)pii„ < 1/2 
for all n large enough. Then [1 — (1 + 7 + 5)piin]~^ < 2[1 — (1 + 7 + 5)piin]~‘^ so that both error 
bounds will be of fhe same rate. Hence, for large enough n, fhe approximation by a further Pois¬ 
son process does not add an error of a bigger size than the one that arises from the approximation 
by onlyPRM(A*). 
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4.2.6 Final bound in the (i 2 -distance 


The following corollary summarises the results from Sections 4.2.1, 4.2.4 and 4.2.5. It gives an 
estimate for fhe error in fhe (^ 2 -distance of the approximation of fhe law of an MPPE with 
i.i.d. Marshall-Olkin geometric marks, living on a lattice of points contained in A* n [— log n, oo)^, 
by the law of a Poisson process wifh a continuous intensity measure A* ^ over A* n [— log n, oo)^, 
where A* = [u*, oo)^ for some choice of fhreshold u* > — log n. 


Corollary 4.13. Let pn^ G (0,1) and assume that qi^, q 2 n poon satisfy (4.2.12). For any choice of 
vf> — log n, define A* = [tt*, oo)^. With the notations from Sections 2.4 and 4.2.1-4.2.3, 


d2 (£(h^*),pRM(a;,)) < 


e {1 + 7 + S)^Pnn 

n [1 - (1 7 + d)piin]^ 


|\/2-|-Tmin |e 1.65e . 


Proof. We have 


d2 {C(E%),PRM{\ls)) 

< d2 PRM(7r*)) + d2 (PRM(7r*), PRM(A*)) + d2 (PRM(A*), PRM(A;^5)) . 


By Theorem 4.3, 

0 

d2 (£(H:^*),PRM(7r*)) < dTV {C{E\.),PRM{7 v*)) < -. 

n 

Furthermore, with the results from Propositions 4.10 and 4.12, and using (1 + 7 + 5) < (1 + 7 + 5)^ 
and [1 — (1 + 7 + 5)pii]“^ < [1 — (1 + 7 + 5)pii]“^, we obtain 


d 2 (PRM(7r*),PRM(A*)) + ^2 (PRM(A*), PRM(A*_ 5 )) 

(1 + 7 + 5 )^Plln 


< 


[1 - (1 + 7 + 5 )piln]^ 


|\/2 + 7min|e 1.65e . 


□ 


By far the smallest component of the error estimate from Corollary 4.13 is given by the 

error arising from approximating ) by PRM(EH^*), which lives on the lattice A* n E* just as 
. A far bigger error emerges for fhe MPPE with Marshall-Olkin geometric marks when going 
from the Poisson process on the lattice to a Poisson process on A* n [— log n, 00 )^ with continuous 
intensity. This error can only be small if the probability of simultaneous success of fhe Marshall- 
Olkin geometric distribution, pn, and thereby also the marginal success probabilities 1 — qin and 
1 — Q 2 n, tend to zero as n —)■ 00 at a rate fast enough to compensate for the factor the (rough) 
number of poinfs expected in A* for each of the processes. Por instance, for A* = [— log log n, 00)^ 
and piin = 1/n, we expect log n joint threshold exceedances, and obtain 


d2 {C{E%),PRM{Xfs)) 


log n 

< + 

n 


(1 + 7 + 5)^ 


n[l - (1 + 7 + 5)/n]3 


|\/2 + 7min |logn, 1.65Y^logn|| 


< 


Clogn 


n 
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where C is some constant. With the (very strong) condition = 1 /n, we thus obtain an error of 
the same size as the error that we obtain when approximating only by PRM(EH^*). 
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